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Abstract 

An adaptive control algorithm is proposed for a class of nonlinear systems, such as 
robotic manipulators, which is capable of improving its performance in repetitive motions. 
When the task is repeated, the error between the desired trajectory and that of the system is 
guaranteed to decrease. The design is based on the combination of a direct adaptive control 
and a learning process. This method does not require any knowledge of the dynamic 
parameters of the system. 

1. Introduction 

The position servo control is an important and basic problem in the successful 
operation of robot manipulators. Many methodologies regarding the solution of this 
problem have appeared in the literature. Recently, the interest in adaptive control of robot 
manipulators has been growing noticeably [1,3]. This growth is mainly due to the fact that 
unlike the non-adaptive control methods, the adaptive control strategies do not require the 
explicit knowledge of robot dynamics parameters. 

On the other hand, recently some works have been reported for the generation of the 
controlling input for the repetitive motion of dynamical systems [4,6]. These are called 
learning controllers because the control input generated this way is improved through 
repeated trials. The method proposed in [4] requires the derivative of the error function in 
the learning process to guarantee the uniform convergence. Moreover, the conditions on 
the system's transfer function is very restrictive, and it also requires that the system's 
inverse dynamics be proper and stable. In [5], the concept of a dual system is used for the 
recursive generation of the input. This result is mainly applied to linear, time-invariant 
systems and the conditions for it's convergence are not restrictive. However, the 
construction of the dual system requires the knowledge of the original system, and it is also 
not practical. In [6], an adaptive learning controller is designed which can be applied to 
robotic manipulators and can be easily implemented. 

In this paper the concept of learning control is applied to the problem of model 
reference adaptive control of manipulators. Our objective is to design an adaptive 
controller capable of learning to improve its performance in repetitive motions. The 
approach is similar to that used in [6], with a slight modification where the concept of 
inner-loop/outer-loop control is used. The proposed adaptive controller can be applied to a 
robot manipulator in non-repetitive motion, in which case it performs as a standard model 
reference adaptive controller. But when it is commanded to perform a task repeatedly, the 
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learning controller improves its performance in subsequent motions. 

We first develop a model reference adaptive control strategy [1,2] to the robot 
manipulator so that the resulting closed-loop system is equivalent to the preselected 
reference model. The model reference adaptive controller designed in [1] is shown to force 
the robot dynamics to follow those of a predetermined linear time-invariant reference 
model very closely. 

Then we develop a new learning controller for a linear time-invariant system L, with 
guaranteed convergence under very mild conditions. This is similar to the learning 
controller proposed in [6], which is based on the existance of an auxiliary system L* such 
that its composition with the original system (i.e., LL*), is positive real. It was shown in [6] 
that such an auxiliary system can always be found if the original system is stable. 

Finally, we apply our learning strategy, as an outer-loop control, to the linearized 
inner-loop system resulted from applying the model reference adaptive controller to the 
robot. The proposed control system is shown in Figure 1 . The overall closed-loop system is 
shown to be asymptotically stable, and does not require any knowledge about the dynamic 
parameters of the robot. The error between the desired response and the actual robot 
response is guaranteed to approach zero after executing the desired task repeatedly. 



Outer Loop 


Figure 1 


2. Robot Dynamics 

The dynamic equation of a robot manipulator is highly nonlinear and is given by 

M(q)q" + H(q,q')q' + g(q) = u (1) 

where q(t) is the nxl vector of joint angles, M(q) is the nxn symmetric, positive definite 
inertia matrix, H(q,q')q‘ is the nxl Coriolis and centrifuged force vector, g(q) is the nxl 
gravitational force vector, and u is the nxl applied torque vector for the joint actuators. It 
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can easily be shown that the above equation can also be written as 


( 2 ) 


x' = A(x) x + B(x) u 
y = C x 

where 


x(t) = 

q 

_q'_ 

r~ 




A(x) = 

0 I 

_Aq(x) Ai(x) _ 

, B(x) = 

0 

_Bq(x)_ 


C = [Ci , C 2 ] 

for some Ci, C 2 , such that x(t) is 2nxl state vector, u(t) is the nxl input torque vector and 
A 0 = M-lG, Ai=-lH, Bo=M*l and g=Gq. 

3. Inner-Loop Adaptive Control 

Consider the robotic system, given by equation (2). Let the dynamic equation of a 
reference model be given by a minimal, 2n-th order, stable linear system 

x m = x m + r ^ 

7m = C m x m 

where x m is the 2nxl model state vector, u is the nxl reference model's input vector, 


Am = 

0 I 

> B m - 

0 


_A m 0 A m l 


_BmO _ 


C m = C = [C 1 ,C 2 ]. 


Now let us define the state error to be 

e = x m - x . (4) 

Then the dynamic equation of the state error is given by 


e — A e + (Aju ■ A) Bj^ r ■ h - B u 

e =Ce 


( 5 ) 



where h is assumed t o be any unmodeled disturbances not already considered in the robot 
dynamics, and e = y m - y is the output error. 

The problem is to design an adaptive controller so that error equation (5) is asymptotically 
stable, where the state error e, and hence the output error e, approach zero as time 
increases. 

Result 1 


Consider the robot manipulator given by the dynamic equation (2), and the linear, 
time-invariant stable, and controllable reference model, given by equation (3). Let the 
adaptive controller be given by 

u = K e e + K x x m + K r r + z (6) 

with the adaptation law 

K’ e = <xj E P e ej + c *2 d/dt(E P e ej) (7) 

K' x = E P e x m T + £2 d/dt(E P e x m ^) 

K' r = Jfj E P e rT + * 2 d/dt(E P e rT) 
z' = tuEPe + r| 2 d/dt(EPe) 


where pj, r|j > 0, E = [0,1], and P and Q are symmetric positive definite matrices 
such that for some stable matrix D of the designer’s choice, we have 

PD + D T P = -Q. (8) 

The above adaptive controller results in an asymptotically stable system such that the 
dynamics of the closed-loop robotic system follow that of the linear reference model. That is, 
the state error e = x m - x, and hence the output error e = y m - y, approach zero as time 

increases. 

Proof 


The proof uses Liapunov’s direct method for stability, and can be found in [1], □ 

The above result indicates that using the proposed adaptive control, the dynamics of the 
nonlinear robot in the closed-loop matches that of the predetermined, stable, linear, 
time-invariant reference model. Therefore, the inner-loop control in Figure 1 can be 
assumed to approximately have the same dynamics as the linear reference model. 
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4. Outer-Loop Learning Control 


Let us consider a linear, stable, time-invariant dynamical system, given by 

x’ = A x + B u (9) 

y =Cx 

where x is the n-dimensional state vector, u is the m-dimensional input vector, and y is the 
m-dimensional output vector. The above system can also be denoted by the linear operator L, 
where y = Lu. 

Now let Y<i(t) be the desired output function (trajectory) of the system over the interval 
[0,T], with the initial state x(0) = xg. Assume that u^(t) and yfc(t) are the corresponding input 
and output functions of system (9) over the time interval [0,T] in trial k, with the initial state 
xg. Then the learning control strategy is the updating rule that generates the input function 

uk+l(t) for the interval [0,T] from the knowledge of u^ft) and the error efc(t) = yd(t) - y]j(t). 
This is then applied to system (9) with the same initial state xg at trial k+1 to drive the error 
function e^(t) to zero. 

Let us define the norm and the inner-product to be given by 

<xj , X2> = J 0 T X1 T(t) X2(t) dt 

llxll 2 = JgT x T (t)x(t)dt 

Then the auxiliary system is defined to be a linear, stable, time-invariant system, given by 

z' = Fz + Ge (10) 

v = H z + E e 

or equivalently by the operator L*, as 
v=L* e 

such that LL* is a positive real operator. That is 


<x , LL* x> > 0, for all x . (11) 

Now let the dynamics of a reference trajectory be given by the linear operator L such that 


y r = L s 

(12a) 

or equivalently 


x j* - Aj* Xj* 4“ B r s 

(12b) 

II 

Q 

X 
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where s=q" r , and 


x r - 


qr 


q r 


A(x) = 

0 

I 

. B(x) = 

0 


0 

0 


I 

C r = ' 

c = [Cl 

c 2 ]. 



Utilizing the definition of auxiliary system (10), we can show the following result. 

Result 2 

Consider the linear, time-invariant minimal, and stable reference system (12), 
denoted by operator L. Suppose that a linear auxiliary system, denoted by operator L*, is 
such that the operator LL* is positive real. Let the learning control strategy for trial k+1 be 
given by 

sk+l(t) = sfc(t) + ajj vfc(t) (13) 

with S0(t)=q" r (t), where 

a k = < e kW' LL * e k( t ) > / 1 III* ek(0ll 2 

e k( t ) = yrW-yk( t ) 

and vk(t) is the output of the linear operator L*, i.e., vfc = L* efc, with zero initial values. 
The above learning controller is convergent in the sense that efc(t) vanishes over the 
interval [0,T] as the number of trials k increases. 

Proof 

Consider the linear system (12) with input (13). Now let a discrete Liapunov candidate 
be given by 

J k = IbkWlI 

where e^U) = y r (t) - yk(t) is the error at trial k. Then we have 
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d k+l ' d k 

= ||ek + l(t)|| 2 -|lek( t )H 2 
= llek(t) - afc L vk(t)|| 2 - llekCt)!! 2 
= a k 2 ||Lvk(t)|| 2 - 2 a k <e k (t) , L v k (t)> 

= a k 2 ||L L* e k (t)|| 2 - 2 a k ||e k (t) , L L* e k (t)|| 2 . 

Now taking a k = <e k , L L* / ||L L* erf, it is easy to check that AJ is minimized with 

respect to a k , and hence AJ k < 0. Therefore, from the discrete Liapunov method, the 
learning controller (13) guarantees that the error function ejj(t) approaches zero as the 

number of trials k increases. ^ 

5. Learning Adaptive Control 

Combining Results 1 and 2, it is possible to design a learning adaptive controller for 
the robotic system. It is desired now that the robot output y = C x follow y^ = C r x r in 

repeated trials. From Figure 1, it can be seen thatx m =x r + s , where s k =[s k ^', s k ^]T and 
r =q" r + s k . That is the dynamic equation of the reference model can be written as 

x "m = A r x m + B r r (14) 

7m ~ ^r x m • 

Now, we have the following result. 

Result 3 

Consider the robot manipulator given by the dynamic equation (2), and the linear, 
time-invariant, stable, and controllable reference model given by the operator L as in 
equation (14). Let the learning adaptive controller be given by 

u = K e e + K x x m + K r r + z ( 15 ) 


with the adaptation law 

K‘ e = oq E P e ej + ot -2 d/dt(E P e ej) 

K‘ x = E P e x m T + ^2 d/dt(E P e x m T ) 

K' r = E P e r T + S 2 d/dt ( E p e r T ) 

z' = r)iEP e + r|2 d/dt(E P e) 

where the state error e = x m - x is the error between the system and the model states. Also 
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sk+l(t) = sk(t) + akvi c (t) 


(16) 


with sg(t)=q H r (t) which is generated by reference dynamics equation (12), and 

a k = <e k (t), LL* e k (0> / l|LL* e^Oll 2 
and also 

v(k) = L* e k (t) 

with zero initial values, such that LL* is positive real, C r =C=[Cj,C2], and efc(t) = yr(t) - yfc(t) 
is the error function between the desired output trajectoiy y^. and the robot's output 
trajectory y in the k-th trial. The above learning adaptive controller is convergent in the 
sense that starting from the same initial state xg, the output error e^(t), and equivalently 
e(t) and e'(t), , approach zero over the interval [0,T] as the number of trials increases. 


Proof 


The proof is merely a combination of the proofs of Results 1 and 2, and is found in [6]. □ 

6. Conclusions 

A new adaptive control is proposed, that is capable of improving its tracking 
performance in repetitive motions. The design can be applied to a class of nonlinear 
systems that includes robotic manipulators. The controller guarantees that the error 
between a desired trajectoiy and the robot's trajectoiy approaches zero as the number of 
trials increases. The proposed controller does not require any knowledge of the dynamic 
parameters of the robot and can be easily implemented. 
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